The multimodal multipulse excitation vocoder

نویسندگان

  • Takahiro Unno
  • Thomas P. Barnwell
  • Mark A. Clements
چکیده

This paper presents a new high-quality, variable-rate vocoder in which the average bit-rate is parametrically controllable. The new vocoder is intended for use with data-voice simultaneous channel (DVSC) applications, in which the speech data is transmitted simultaneously with video and other types of data. The vocoder presented in this paper achieves state-of-the-art quality at several different bit-rates between 5.5 Kbps and 10 Kbps. Further, it achieves this performance at acceptable levels of complexity and delay.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum-take-precedence ACELP: a low complexity search method

The ACELP method makes use of multipulse structure to represent the excitation pulses of residual signal. With the purpose of computational complexity reduction, this paper provides the Maximum-TakePrecedence ACELP (MTP-ACELP) search method under the acceptable degradation in performance. Because the maximum of target signal is preferentially compensated, the degradation of performance would be...

متن کامل

Selection of excitation vectors for the CELP coders

In this paper, we investigate several algorithms that construct the input for the synthesis filter in the CELP coder, we present them under the same formalism, and we compare their performances. We model the excitation vector by a linear combination of K signals, which are issued from K codebooks and multiplied by K associated gains. We demonstrate that this generalized form incorporates severa...

متن کامل

A Glottal Vocoder Employing Vector Quantization

This paper describes a speech coder for low bit rates using a parametric representation of voiced excitation waveforms (Glottal ARX) and standard LPC for unvoiced. For efficient compression purposes the excitation and spectrum parameters are quantized with vector quantization (VQ). This has resulted in a glottal vocoder operating at 1320 bits/s and sounding more natural than a standard LPC voco...

متن کامل

Efficient multipulse approximation of speech excitation using the most singular manifold

We propose a novel approach to find the locations of the multipulse sequence that approximates the speech source excitation. This approach is based on the notion of Most Singular Manifold (MSM) which is associated to the set of less predictable events. The MSM is formed by identifying (directly from the speech waveform) multiscale singularities which may correspond to significant impulsive exci...

متن کامل

Multipulse Sequences for Residual Signal Modeling

In source-filter models of speech production, the residual signal what remains after passing the speech signal through the inverse filter contains important information for the generation of naturally sounding re-synthesized speech. Typically, the voiced regions of residual signals are regarded as a mixture of glottal pulse and noise. This paper introduces a novel approach to represent the nois...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997